Indexing Techniques for Temporal Text Containment Queries

نویسنده

  • Sharath Srinivas
چکیده

Many information management systems maintain multiple time stamped versions of documents. The archives of web pages, version control systems, wikis and backup mechanisms are examples of such systems. For such temporally versioned document collections, a search using keywords along the temporal dimension is valuable. This paper studies the temporal dimension of keyword search in the context of text document collections. The inverted index, which is an integral part of keyword based IR technique, requires several extensions for it to support keyword search over temporal document collections. We propose a number of techniques that explore such extensions. Several experimental results are also presented to compare the proposed solutions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Space-Efficiency in Temporal Text-Indexing

Support for temporal text-containment queries is of interest in a number of contexts. In previous papers we have presented two approaches to temporal text-indexing, the V2X and ITTX indexes. In this paper, we first present improvements to the previous techniques. We then perform a study of the space usage of the indexing approaches based on both analytical models and results from indexing tempo...

متن کامل

Norwegian University of Science and Technology Technical report IDI-TR-11/2002 Supporting Temporal Text-Containment Queries

In temporal document databases and temporal XML databases, temporal text-containment queries are a potential performance bottleneck. In this paper we describe how to manage documents and index structures in such databases in way that makes temporal text-containment querying feasible. We describe and discuss different index structures that can improve such queries. Three of the alternatives have...

متن کامل

IDI - TR - 11 / 2002 Supporting Temporal Text - Containment Queries

In temporal document databases and temporal XML databases, temporal text-containment queries are a potential performance bottleneck. In this paper we describe how to manage documents and index structures in such databases in way that makes temporal text-containment querying feasible. We describe and discuss different index structures that can improve such queries. Three of the alternatives have...

متن کامل

Supporting temporal text-containment queries in temporal document databases

In temporal document databases and temporal XML databases, temporal text-containment queries are a potential performance bottleneck. In this paper we describe how to manage documents and index structures in such databases in a way that makes temporal textcontainment querying feasible. We describe and discuss different index structures that can improve such queries. Three of the alternatives hav...

متن کامل

Demo of SemIndex: Semantic-Aware Inverted Index on Text

Processing keyword-based queries is a central problem in Information Retrieval (IR), where several studies have been done to develop effective keyword-based search techniques [1, 2]. A standard containment keyword-based query, which retrieves textual identities that contain a set of keywords, is generally supported by a full-text index. The inverted index is considered as one of the most useful...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008